LTR_STRUC: a novel search and identification program for LTR retrotransposons

نویسندگان

  • Eugene M. McCarthy
  • John F. McDonald
چکیده

MOTIVATION Long terminal repeat (LTR) retrotransposons constitute a substantial fraction of most eukaryotic genomes and are believed to have a significant impact on genome structure and function. Conventional methods used to search for LTR retrotransposons in genome databases are labor intensive. We present an efficient, reliable and automated method to identify and analyze members of this important class of transposable elements. RESULTS We have developed a new data-mining program, LTR_STRUC (LTR retrotransposon structure program) which identifies and automatically analyzes LTR retrotransposons in genome databases by searching for structural features characteristic of such elements. LTR_STRUC has significant advantages over conventional search methods in the case of LTR retrotransposon families having low sequence homology to known queries or families with atypical structure (e.g. non-autonomous elements lacking canonical retroviral ORFs) and is thus a discovery tool that complements established methods. LTR_STRUC finds LTR retrotransposons using an algorithm that encompasses a number of tasks that would otherwise have to be initiated individually by the user. For each LTR retrotransposon found, LTR_STRUC automatically generates an analysis of a variety of structural features of biological interest. AVAILABILITY The LTR_STRUC program is currently available as a console application free of charge to academic users from the authors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mosquitoes LTR Retrotransposons: A Deeper View into the Genomic Sequence of Culex quinquefasciatus

A set of 67 novel LTR-retrotransposon has been identified by in silico analyses of the Culex quinquefasciatus genome using the LTR_STRUC program. The phylogenetic analysis shows that 29 novel and putatively functional LTR-retrotransposons detected belong to the Ty3/gypsy group. Our results demonstrate that, by considering only families containing potentially autonomous LTR-retrotransposons, the...

متن کامل

A Nest of LTR Retrotransposons Adjacent the Disease Resistance-Priming Gene NPR1 in Beta vulgaris L. U.S. Hybrid H20

A nest of long terminal repeat (LTR) retrotransposons (RTRs), discovered by LTR_STRUC analysis, is near core genes encoding the NPR1 disease resistance-activating factor and a heat-shock-factor-(HSF-) like protein in sugarbeet hybrid US H20. SCHULTE, a 10 833 bp LTR retrotransposon, with 1372 bp LTRs that are 0.7% divergent, has two ORFs with unexpected introns but encoding a reverse transcript...

متن کامل

Newly identified families of human endogenous retroviruses.

Human endogenous retroviruses (HERVs) make up approximately 8.3% of the human genome (12). HERVs have previously been classified into 31 distinct families based upon sequence alignment of reverse transcriptase (RT) and envelope domains and subsequent phylogenetic analyses (1, 9, 16). Using the data mining program LTR_STRUC (13) in conjunction with conventional sequence homology techniques, we r...

متن کامل

Getting an Evolutionary Handle on Life after Reproduction

Background: LTR retrotransposons are a class of mobile genetic elements containing two similar long terminal repeats (LTRs). Currently, LTR retrotransposons are annotated in eukaryotic genomes mainly through the conventional homology searching approach. Hence, it is limited to annotating known elements. Results: In this paper, we report a de novo computational method that can identify new LTR r...

متن کامل

MGEScan-non-LTR: computational identification and classification of autonomous non-LTR retrotransposons in eukaryotic genomes

Computational methods for genome-wide identification of mobile genetic elements (MGEs) have become increasingly necessary for both genome annotation and evolutionary studies. Non-long terminal repeat (non-LTR) retrotransposons are a class of MGEs that have been found in most eukaryotic genomes, sometimes in extremely high numbers. In this article, we present a computational tool, MGEScan-non-LT...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 19 3  شماره 

صفحات  -

تاریخ انتشار 2003